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The present paper studies the limiting behavior of the average 
score of a sequentially selected group of items or individuals, the 
underlying distribution of which, F, belongs to the Gumbel domain 
of attraction of extreme value distributions. This class contains the 
Normal, Lognormal, Gamma, Weibull and many other distributions. 
The selection rules are the "better than average" (/3 = 1) and the 
"/3-better than average" rule, defined as follows. After the first item 
is selected, another item is admitted into the group if and only if 
its score is greater than /3 times the average score of those already 
selected. Denote by Y k the average of the k first selected items, and 
by Tfe the time it takes to amass them. Some of the key results ob- 
tained are: under mild conditions, for the better than average rule, 
Yfc less a suitable chosen function of log A: converges almost surely 
to a finite random variable. When 1 - F(x) = e~^ a+h ^ , a > and 
h(x)/x a o, then Tk is of approximate order k 2 . When /3 > 1, the 
asymptotic results for Yk are of a completely different order of mag- 
nitude. Interestingly, for a class of distributions, Tk, suitably normal- 
ized, asymptotically approaches 1, almost surely for relatively small 
P > 1, in probability for moderate sized ft and in distribution when 
/3 is large. 

1. Introduction and summary. Individuals are observed sequentially. The 
problem of whether to accept an individual at the time that she is observed 
has a rich literature. The most celebrated version is the "Secretary Prob- 
lem," where the criterion is to select one individual and the objective is 
to maximize the probability that the best individual is chosen. This set- 
ting has been extended in various ways including selecting a limited number 
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of individuals and basing the reward on the rank or score of individual(s) 
selected. 

Another extension that has received recent attention is to select a group 
of "quality members." This might occur when a team of highly qualified 
professionals is assembled, for example, in an academic department or a 
consulting group in a specialized area. The goal is to find good rules for 
either accepting or rejecting each additional individual into the group at the 
time that the individual is observed. 

One such rule that has been studied is to add a new member to the 
group only if this will not decrease the average quality of the group, termed 
in the literature as the "better than average selection rule." This tacitly 
assumes that "quality" is measurable. A generalization of the rule would be 
to only admit a new member whose score is say 5% higher than the current 
average. We term the extended rules as "/3-better than average rules." These 
rules reduce to the better than average rule when ft = 1 , first considered by 
Preater [5], but allows say for ft = 1.05 to produce a group that is even more 
progressively selective than when (3 = 1. 

The assumption that is commonly made is that the quality of the individ- 
uals are mutually independent from a common distribution. As the horizon, 
n, tends to infinity we study the asymptotic behavior of the average quality 
of the group and the rate at which the group grows for the /3-better than 
average rules. 

The /3-better than average rules are considered in Krieger, Pollak and 
Samuel-Cahn [4], and the present paper, which can be read independently, 
can be considered its natural continuation. Sequential selection of a "good" 
group, based only upon the relative ranks of the observations is consid- 
ered in Krieger, Pollak and Samuel-Cahn [3]. It should be noticed that the 
rules considered here can be implemented without knowledge of the under- 
lying distribution, though their asymptotic behavior depends strongly on 
that distribution. For convenience, we assume that the first item is always 
selected. However, all asymptotic results remain correct if the selection pro- 
cess is adopted only after a core group of members already exists. Also, the 
random variable is assumed to be nonnegative (or the process begins with 
the first nonnegative observation), because negative averages multiplied by 
ft > 1 provide a lower level for inclusion. 

Two quantities are of interest: the average quality, Yk, of the group, after 
k items have been retained, and Tj., the time (in terms of the number of 
observed items) it takes to amass a group of size k. Our interest is in the 
asymptotics of these quantities, as k— >oo. This paper, unlike [4], consid- 
ers F belonging to the extreme value domain of attraction of the Gumbel 
distribution exp{— e~ x } only. Write 1 — F(x) = exp{— H(x)}. Emphasis is 
given to a subset of these distributions, which are also "stretch exponential" 
distributions, where H(x) = x a + h(x), with h(x)/x a — > 0, for all x > xq, 
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for some xq, where a > 0. This class includes the Gamma and Normal dis- 
tributions as particular cases. 

The "expected overshoot," f(x) = E(X — x\X > x), plays an essential 
role. Our main findings are: for the "better than average" rule (/3 = 1), 
under some mild conditions, the quantity Y k — G~ 1 (logk) converges a.s. to 
a finite random variable where G(x) = f^ Q 1/ f(u) du. These mild conditions 
are satisfied in particular by the stretch exponential distributions with a > 1 . 
It is easy to show that the functions G(x) and H(x) are close to each other 
in that G{x) = (1 + o{l))H(x). The convergence of Y k — G~ 1 (logk) is shown 
in Section 3, where also the convergence of the sequence of expected values 
and variances of {Y k — G~ 1 (log/c)} is established. The behavior of Y k for 
/3 > 1 is very different. In Section 4 we show, under mild conditions, that for 
P > 1, Y k jk^~ x converges a.s. to a finite positive random variable. 

The behavior of T k is discussed in Section 5. It is shown that for stretch 
exponential random variables with a > 0, (3 = 1 and every e > one has 
T k /k 2 ~ £ oo a.s. as k ->• oo, while T k /k 2+£ ->■ a.s. When a = 1, T k /k 2 
converges to a finite positive random variable. The "standardized" variable 

fc-i 

for /3 > 1 is considered and has a very interesting behavior for the stretch 
exponential with a > 0. For different values of /3 we obtain different asymp- 
totic behavior: we show that for 1 < /3 < 1 + l/2ct the random variable T£ 
converges to 1 a.s. For 1 + l/2a </3<l + l/ait converges to 1 in probabil- 
ity. For (3 > 1 + 1/a the random variable T£ converges in distribution to an 
exponential mean one distribution, while for f3 = 1 + 1/a the convergence 
in distribution is to a sum of conditionally independent exponential random 
variables. We conclude with Section 6, which contains further comments and 
remarks. Section 2 contains some preliminaries. Proofs are relegated to the 
Appendix in order to highlight the results in the paper. 

2. Mathematical preliminaries. The observations are denoted by X\ , X2, ■ ■ ■ 
and are i.i.d. random variables from a common absolutely continuous dis- 
tribution F. We assume that 1 — F(x) > for all x < 00, unless stated 
otherwise. 

The behavior of rules will be characterized by considering two quantities: 

• T k = The number of observations inspected until the kth item is retained 
(including that item). 

• Y k = The average score of the first k items that are retained. 

The f3 better than average rule is defined as follows: for fixed f3 (which 
is suppressed in the notation) and T k defined above as the number of items 
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observed until the fcth item is selected, let T\ = 1 and Y\ = X\. Define T k 
and Y k inductively by 

T k+1 = M{i >T k :X l > 0Y k }, k = 1, 2, . . . , 
Yk+i = Xx k+1 , k = 1, 2, — 

It is clear that Y k increases in k for (3 = 1. If j3 > 1 we assume nonnegative 
Xi to avoid the situation that if Y k is negative then the cutoff to retain an 
observation becomes less stringent. 

2.1. Theorems on almost sure convergence. In this subsection, first we 
present two theorems, that exist in the literature, which will be useful in 
proving asymptotic results for the quantities of interest. First, we shall need 
the following result, due to Robbins and Siegmund [6], quoted as follows: 

Theorem 2.1. Let (0, J 7 , P) be a probability space and T\ C Ti C • ■ • a 
sequence of sub-a -algebras of J 7 . For each n= 1,2,..., let z n ,j3 n ,^ n and Q n 
be nonnegative T n -measurable random variables such that 

E(z n \F n ~i) < Z n -l{l + P n -l) + in-l — Cn-l- 

Then lin^^oo z n exists and is finite and Y^=i Cn < oo a.s. on -Q^^Li Pn < 

°°> E^°=l£n<00}. 

Corollary 2.1. Let z n ,(3 n ^ n and Q n be nonnegative sequences of con- 
stants such that fin an d converge, and 

Z n < Z n -i(l +P n -l) + £n-l ~ Cn-l- 

Then lmin-j.oo z n exists and is finite and EnLi Cn <oo. 
Proof. This follows trivially from Theorem 2.1. □ 
We also need the following theorem that appears in Feller [2], page 239. 

Theorem 2.2. Let Qi, Q2, ■ ■ ■ be independent r.v.s with E(Q n ) = 0, and 

let S n = Y" =l Qi- If: 

(1) 61 < 62 < ' • ' - > 00 ar e constants and 

(2) En=iE(Ql/bl)<oo 

then b~ 1 S n — > a.s. as n — > 00. 
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2.2. Classes of distributions. Preater [5] showed that when F is exponen- 
tial and P = 1, Yf. — log A; converges almost surely to a Gumbel distribution. 
Krieger, Pollak and Samuel-Cahn [4] extended this result in several ways. 
The asymptotic behavior of other quantities, such as Tk, were obtained, val- 
ues of j3 > 1 were considered and other F, such as the Pareto and Beta, were 
analyzed. 

An interesting question is how the rules behave for other distributions 
F. This depends on the behavior of the overshoot, X — a\X > a, and its 
expectation /(a), 

(2.1) f(a):=E(X -a\X>a). 
Let xf = sup{x : F(x) < 1}. 

Definition 2.1 (See [7], Section 1.1). A distribution function F, belong- 
ing to the domain of attraction of the Gumbel extreme value distribution 
A(x) = exp{— e~ x }, — oo < x < oo, is called a Von Mises function (VM) if 
there exists xq such that for xq < x < xf and r > 

(2.2) l-F(x) = re W ^- Jjl/f A (u)]du^ :=e~ H W, 

where /a(^) > 0, xq < u < xp and /a is absolutely continuous on (xq,xf) 
with derivative /^(it) and \iva. U/ x XF f'^iu) = 0. 

Note that Definition 2.1 is identical to definition (1.3) in Section 1.1 of [7], 
except that we refer to the auxiliary function by /a instead of / to distin- 
guish it from the expected overshoot function. To link the two functions, we 
define g(u) = f(u)/fA(u)- The representation of a VM distribution function 
F in (2.2) is equivalent to 



e -tf(«). 



(2.3) 1 - F(x) = r expj - jf [g(u)/f(u)} du} : 

It is shown in [7] that a twice differentiable VM distribution satisfies 
(2 4) F»(x)(l-F(x)) _ 

It can be shown that \im u y XF g{u) = 1 follows from (2.4). 
Let G(x) be defined by 



(2.5) e- G(x) =rexp<| - / [l/f{u)]du\. 

Thus, 



XO 
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Note that 

(2.7) ±G- 1 (z) = f(G- 1 (x)). 

ax 

It is clear by this definition that H(x) = (1 + o(l))G(x) as 1/ xp. 
We shall consider only such VM for which xp = oo. (But see Remark 3.1.) 
Some of our results that will follow hold for VM distributions, but most 
of the results pertain to a rich subclass. Specifically: 

Definition 2.2. F is a generalized stretched exponential distribution if 
it is VM with H(x) = cx a + h(x), h"(x) exists and c> 0, a > are constants 
where 

h(x) 

(2.8) lim = 
and 

(2.9 lim —^4 = 0. 

This class of distributions is denoted by Q a . By change of variables y = 
^-l a x it suffices in the sequel to consider only c = 1. 

The reason for extending the stretched exponential by adding h(x) is 
to include many of the classical families of distributions such as Normal, 
Gamma, Lognormal and Weibull. For example, the right-hand tail probabil- 
ity of the standard normal behaves like (fi(x)/x by Mills' ratio where <p{x) is 
the standard normal density. Hence the standard normal belongs to Q2 with 
h(x) = log(x). 

3. Average, when (3 = 1. In this section we consider the behavior of 
the average after k items are retained, using the better than average rule. 
The emphasis is on the random variables that are generated from a VM 
distribution. In the first subsection, we consider the almost sure behavior, 
and in the ensuing subsection, results for the expectation and variance of 
Y \ are presented. 

Let Zk = Yk — Yk-i, the "overshoot" over Y^-i - The results are based on 
the following relationship: 

(3.1) Y k - r fe _i + — - + — - — , 

where Z{a) is distributed like X — a\X > a. 

The results depend on the expected overshoot f(a) = E[Z(a)\. We shall 
use the following lemma later, that gives the expected overshoot and squared 
overshoot for F in Q a for large values of a. Specifically: 
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Lemma 3.1. If the underlying distribution is in Q a , a > then 



(3.2) 




and 



(3.3) 



EZ 2 (a) 



= 1. 



a^oo 2a 2 ( 1 " Q ) I a 2 



The proof of the results uses 1'HopitaPs rule on E{Z{a)) = / °°(1 — 
Fz(a){y)) dy for the expected overshoot and E(Z 2 (a)) = 2 y(l — F z ^(y)) dy 

This result implies that f(a) = a}~ a /a[l + o(l)]. In some instances we 
need a more refined result on the rate, that is the o(l) term, which depends 
on h(x). An easy case, as shown in the proof of Corollary 3.1, is when 
h(x) = 0, in which case the rate of o(l) is l/a a . 

In the more general case, we want to include h(x). The point of adding 
h(x) is to extend our results to known distributions such as the Normal. The 
role that h(x) plays, is that it is small relative to x a . 

The following lemma provides a handle on the overshoot. 

Lemma 3.2. If F~Q a , then 



Furthermore, if h'(x)/{x a 6 1 ) goes to 0, where < e < a, we have that 



These conditions on h{x) and its derivatives are hardly restrictive as the 
intent is for h(x) to be small. In particular, if h(x) is x 7 for 7 < a, then all 
of the above conditions hold. 

The details of the proofs throughout this and the remaining sections of 
the paper appear in the Appendix. 

3.1. Results on almost sure convergence of the mean. The main result 
in this subsection is that under mild conditions Y ^ — G _1 (log/c) converges 
almost surely to a finite random variable (Theorem 3.2). This is an extension 
of the result in [5] that — log/c converges a.s. to a Gumbel distribution 
when observations are generated from an exponential distribution. Theorem 
3.1, which is simpler than Theorem 3.2, considers only the Q a class of dis- 
tributions. This theorem standardizes Y ^ by dividing it by a function of k. 
Theorem 3.2, however, provides a stronger result, which for the Q a class of 
distributions is applicable when a > 1. 

The following theorem requires a slight strengthening of condition (2.9). 




(3.4) 
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Theorem 3.1. If the underlying distribution function is in Q a , where 
a > 0, and 

(3.5) lim h'(x) /Z" 6 " 1 = for some e > 

x— >oo 

and 

lim h"{x)/x a - 2 = 0, 
x— >oo 

then 

, \ Yk , Y k Y k 

(3.6) lim — — = lim — — = lim rr 1 — = 1 a.s. 

k-^oo (logk) 1 / k-tooG- 1 (log k) k^oo if- 1 (log A;) 

The proof considers Sfc = (^4fc — l) 2 where ^4^ = n og fcU/a • Theorem 2.1 is 
used to show that S k converges almost surely. We do not believe that the 
strengthening of condition (2.9) by (3.5) is necessary for the conclusion to 
hold, though we use it in the proof. We know from Theorem 3.2 that it is 
not needed for a > 1. 

The second result of this subsection, Theorem 3.2, is the stronger state- 
ment that Y k — G~ (log A;) converges a.s. to a finite random variable as 
k — > oo. The conditions for this result are different from those of Theorem 
3.1, but distributions in Q a with ct ^> 1 satisfy the conditions of Theorem 3.2 
without the additional conditions on h made in Theorem 3.1. 



Theorem 3.2. Let (3 = 1 and F be a VM distribution. Then under con- 
ditions: 

(A) EZ 2 (a) < a 7 for some < 7 < 00 and all a > oq and 

(B) /'(a) < for all a > ao, for some ao < 00 

Y k — G~ 1 (logfc) converges a.s. to a finite random variable as k— > 00. 

The core of the proof is to show that \Y k — G~ 1 (log/c)] 2 converges almost 
surely, using Theorem 2.1. 

Conditions (A) and (B) are usually satisfied for F a VM distribution when 
G(x) increases fast enough. In particular they hold for F G Q a with a > 1. 
That condition (A) holds (for all a > 0) follows from Lemma 3.1. Condition 

(B) holds since here f(x) = l/G'{x) = (1 + o{l)){ax a ~ l [1 + ^^r]}" 1 , so 
from (2.9) f{x) is eventually decreasing. The case a = 1 holds when h{x) 
is increasing. If F has increasing failure rate (IFR), that is, satisfies "new 
better than used," then condition (B) is satisfied. 

Remark 3.1. If xf < 00, it is easy to see that lim/^oo [Y^ — G~ 1 (log k)] = 
a.s. An example F of a VM distribution with xp < 00 is 1 — F(x) = 
e 1 / x I(x<0). 
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Corollary 3.1. Let F ~ Q a with a > 1 and h(x) = 0. Then 

Y k — log 1 /" k converges a.s. to a finite random variable as k—^oo. 

Remark 3.2. The conclusion of Theorem 3.2 does not hold for all F € 
Q a , a > 0, thus not for all VM, for example, H(x) = x 1 / 2 . We omit the proof. 

3.2. Results on convergence of moments. 

Theorem 3.3. // conditions (A) and (B) given in Theorem 3.2 hold 
then there exist constants < &i, 62, &3 < 00 such that 

[EY k - G" 1 (log k)]^h, 
E[Y k -G- 1 (log k)\ 2 ^b 2 

and hence 

Vav[Y k -G- 1 (log k)]^b 3 . 

4. Average, when /3 > 1. In this section we consider the behavior of Y k 
under the more stringent condition that an observation is retained only if it 
exceeds (3 times the previous average, where /3 > 1. The main result is that 
Y k must be standardized by an entirely different quantity, namely, k 13 ^ 1 , in 
order to get a.s. convergence. For F £Q a this standardization is correct for 
all a > 0. The result depends on the following relationship: 

,4,, n=n _ 1+ <^i)Itl + ^Itl). 



The result concerns B k = -X^ . Let T k denote the a-field generated by 
Y\, . . . ,Y k . It follows by dividing both sides of (4.1) by k^~ l that 



F k 



k-l 



E(B k \J r k-\) = (l + O (^) ) + E 
(4-2) ^ ^ " 

= B k ^ (l + O U\ ) + f(pY k -i)/kP . 

Hence if the expected overshoot is bounded, it follows from Theorem 2.1 
that B k converges almost surely. A more refined result appears in the next 
subsection followed by remarks on special cases. The section ends with re- 
sults showing that under some conditions the expected value and variance 
of B k also converge. 
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4.1. Almost sure convergence of the mean. We first show that Bk con- 
verges almost surely under more general conditions in the following: 

Theorem 4.1. Assume F is a VM distribution. Let Bk = and 
f{x) = E(X-x\X>x). 

(i) If f(x) < (log^i+e > where c> and e > 0, then Bk converges a.s. to 
a nondegenerate positive random variable. 

(ii) If Bk converges a.s., f is monotone and lim^oo E{Bk) < oo then 
for some constant xq > 0, 

(4.3) f°° fM-dx <oo. 

Jxo x 

Remark 4.1. (a) Note that the sufficient condition (i) of Theorem 4.1 
can hold also for distributions that are not VM. An example is the Geometric 
distribution. 

(b) Equation (4.3) does not have a /3 in the expression. Also, under the 
more restrictive condition of bounded expected overshoot that was used to 
introduce this section (which led to an easy proof of almost sure convergence 
of the desired quantity), (4.3) holds. 

The following is a general statement about convergence of Yk for the 
stretched exponential family of distributions. 

Corollary 4.1. Let F E Q a , a > 0, /3 > 1. Then there exists a random 
variable < Wp < oo such that 

__ _^ W p a . s . 

Proof. S ince x i—c}/ a — ^ 1 it follows that /(x) < cx^ a for some constant 
c> for all x > xq for suitable choice of xq. Hence the condition in (i) of 
Theorem 4.1 holds. □ 

There exist VM distributions for which Bk fails to converge a.s. to a finite 
limit. The proposition below provides a general result for when Bk does not 
converge to a finite limit a.s. 

Proposition 4.1. Let ^(a) be an increasing positive function of a such 
that 

(4.4) r^i dx=O0 _ 
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Let B k = -p^r, Z(a) ~X- a\X > a and define Z*(a) = Z(a)/^(a). If there 
exists a constant a$ and a nonnegative random variable V , not identically 
zero, such that for all a > ao V is stochastically smaller than Z*(a), then 
Bk — > oo a.s. as k — >■ oo. 

Example 4.1. Let 1 — Fx{x) = e~^ ogx ^^ 2 , which is easily seen to be a 
VM distribution. Let ^*{a) = a/log(a). We shall show that the conditions 
(and hence the conclusions) of Proposition 4.1 hold for this example. 

Proof of Proposition 4.1. It is immediate that f °° = oo. 
Furthermore, 

1 - Fx{a + xa/\oga) _ exp{-(logq + log(l + x/loga)) 2 /2} 
Z *^ [X) - 1-F x (a) ~ exp{-(loga)V2} 

1' ' 



,M -(loga) (bg^l + ^ ) ) ~ -(lo S [l + — 
> exp-| —x — X - „ X > exp{— x — x 2 /2} 



(loga) 

for all a > e. Hence, if V is such that 1 — Fy(x) = e~ x ~ x2 ^ 2 l(x > 0) then V 
is stochastically smaller than Z*(a) for all a > e. Furthermore, 

SoZ)p*(^ 1 ) = X^Pl og ( 7 A;/3-l) 
fc=l fe=l &V ' ' 

n 

= 7 ^[A;(log7 + 00-1) log A:)]" 1 
fc=l 

= oo. □ 



Note that since here f{x) = [1 + o(l)]x/logx, it follows that one cannot 
take e = in Theorem 4.1(i). 

4.2. Convergence of moments. We now turn to showing that the expec- 
tation of Yk suitably normalized converges to a finite limit for all random 
variables that belong to the stretch exponential. 

We first consider EB^ and Var B^ in a general setting. 

Theorem 4.2. Under the following three conditions: 

(a) Var X < oo; 

(b) f(a) is nonincreasing for a > ao; 

(c) EZ 2 (a) < ca for some c > and a> a$; 
EBh and Var B^ converge to a finite limit. 
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Remark 4.2. Condition (a) always holds for nonnegative X with F a 
VM distribution (see Exercise 1.1.1(a) of [7]). Lemma 3.1 implies that (c) 
holds for any F £Q a with a > 1/2. Condition (b) holds for all F £ Q a , a > 1, 
as well as for X ~ Exp(l). 

The above theorem does not apply for F £Q a , with a < 1. Nevertheless, 
EB k converges in this shown in the following theorem. 

Theorem 4.3. Let B k = j&r- If F eQ a , a> and (3 > 1, then EB k 
converges to a finite limit. 

Proof. By Theorem 4.2 we need only consider the case a < 1. From 
Lemma 3.1 it follows that for some c and k large enough 

/03F fc _!) < cTXlX = C[(k - iy , - 1 B fc _ 1 ] 1 - a = C(fc - 

Substituting this into (4.2) yields 



(4.5) E(B k \F k ^) < 



1 + 



1 



£.min(2,l+(/3-l)a) 



1 



jfcl+08-l)o 



Taking expectations on both sides of (4.5) and using Corollary 2.1 yields the 
result. □ 

5. Time until k items are kept. 

5.1. Discussion of the problem. In this section we turn to the second 
quantity of interest, T k , the number of items that are observed until k items 
are retained. Unfortunately, it is generally impossible to normalize T k by 
a function of k and achieve almost sure convergence to a nondegenerate 
random variable. Instead we consider the following quantity: 

(5-1) T* k = , 

which depends on the averages {Yj}, the expectation of which tends to 1. 

The results are obtained for the Q a , a > class of distributions. One in- 
teresting facet of the results for a > 1 is that the nature of the convergence 
depends on (3. When j3 is relatively small, 1 < ft < 1 + ^— , then the conver- 
gence is almost sure to 1. When /3 is moderate in size, 1 + -^<(3<1 + ^, 
the convergence is to 1, in probability. Finally, if /3 is large, /3 > 1 + — , the 
convergence is in distribution to an exponential or a sum of conditionally 
independent exponential random variables with means summing up to 1. 
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Theorem 5.1. Let (3 = 1 and Xi ~ F where F is Q a , a > 0. Then 



T£ = ^ k _ 1 — — ; — > 1 almost surely. 



The proof uses Theorem 2.2 by conditioning on the responses {Ifc}, letting 
Pj = l- FiYj.,), bj = ELi Pf 1 and Q l = T i - Tj_i - Pr 1 with T = 0. 

Though Theorem 5.1 gives no explicit order of magnitude of the conver- 
gence of Tfc, in terms of A:, we get an idea of this magnitude in the following: 



Corollary 5.1. For any 5 > and F G G a , a > 0, /3 = 1 
limTfc/fc 2-5 = oo and lim / A; 2+<5 = a.s. 

rp 

For the exponential distribution converges a.s. to a limit as shown in 
[4]- 

5.3. Asymptotic results when ft > 1. The focus is on T£, the number of 
observations that are observed until k items are retained suitably normalized 
as defined in (5.1). 

For the sake of clarity, we consider in the continuation only F G Q a , a > 0, 
where h(x) = 0, that is, H(x) = x a . 

Theorem 5.2. Let X ~ F where 1 - F(x) = e~ xa and a > 0. Then as 
k — > oo: 

(i) T *^lforl<(3<l + ±, 

(ii) T* k Al/orl + ^</3<l + i, 

(iii) r * A Exp(l) and Jt- A Exp(l) /or /3 > 1 + i. 

The result for /3 = 1 + — is of a different nature, and hence is treated 
separately in Theorem 5.3. To prove parts (ii) and (iii) of Theorem 5.2, we 
compute the limiting generating function of Tj., suitably standardized, and 
are able to recognize the distribution for which this limit is the generating 
function. The results then follow from the Continuity theorem. This line of 
reasoning is also used in proving Theorem 5.3. 

Note that for J7~ Geo(p), 

1 



Ee 



-tu 



1 + (1 - e-*)/(pe-*) ' 
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We ignore the first observation which adds one to T k (this will have no effect 
on the limiting distribution) . Hence the resulting random part of X& (which 
we refer to as T k ) is the sum of, conditionally on {Yj} c ?L 1 , independent 

geometric random variables where pj = e~(P Y i- 1 ' . We have conditionally 
on {Yj} 



J=2 



1 _ e -t~f(k) 

1+ P -(^y J -_i)°-t7(fc) 



where the sequence j(k) is positive, and will be defined as a function of the 
given {l^fe}, according to the need in the proof for each particular instance, 
but always tends to 0. Thus 

k / 1 o-*7(fe) — \ 

logEe"***^ = - J>g( 1 + ~ tl{k) t^e^+^^n 

(5.2) 

k 

= -J2 log[l + (1 + o k (l))tj(k)e^ Y ^ a }. 

3=1 

For (ii) we let 7 (jfe) = 1/Ej=i e(/?Fj)Q and for ( m ) we let l( k ) = e~ WYk ~ l)a ■ 
We now turn to the case where f3 = 1 + 1/a, so that (3 — 1 = 1/a. This 
is the only case where conditioning on the sequence {Y k } plays a role in 
the limiting distribution obtained. We know from Theorem 4.1 that there 
exists a random variable W, < W < oo, such that Y^/k 1 ^ k ^Sf \y a s 
Our result will be stated in terms of the value of W. 



Theorem 5.3. Let X ~ F where 1 — F(x) = e~ xa , a > and /3 = 1 + 1. 
Let W = Y\mY k /k l / a . Then 



oo 

D 



TJ* — > ^2 Rj as k — > oo, 
i=i 

where, conditionally onW = w, the Rj are independently, exponentially dis- 
tributed with mean fij, where 

_ exp((3w) a - 1 
H ~ exp\j(Pw) a ] ' 



Note that the fij sum to 1. 
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6. Concluding remarks. The present paper extends the results in [4] 
where the Exponential, Beta and Pareto distributions are considered in de- 
tail, to other distributions that include the Normal, Gamma and Weibull. 
The results on the special distributions considered in [4] are "invertible" in 
the sense that rates of convergence for Y and T/% imply rates of convergence 
for the number of items that are kept and the average of the items kept after 
n items are observed. The results obtained for the distributions considered 
here are in general not invertible in this way. 

Preater in [5] considered the behavior of the average of the first k items 
that are kept, Y^, when the distribution generating the observations is ex- 
ponential and j3 = 1 in the {3 better than average rule. He observed that 
Yfc — log k converges a.s. and in L2 to a Gumbel distribution. The behavior 
of this quantity for f3 > 1 is markedly different. When (3 = 1, Y \ — log A; con- 
verges a.s. When $ > 1, Yfc/fc^" 1 converges a.s. In addition, the rate when 
/3 > 1 holds for many distributions, while the amount that one subtracts 
from Yfc when f3 = 1 depends on the distribution. 

There are two interesting mathematical observations. First, it is not sur- 
prising that there should be some relationship between the domain of attrac- 
tion to which the extremal distribution of F belongs and the limiting distri- 
bution of Yfc, since the Y^ process will, on the average, select larger and larger 
items. Preater in [5] shows that Y^ — log k and max{Xi , . . . , Xk} — log k have 
the exact same limiting Gumbel distribution when the observations are i.i.d. 
from an exponential distribution (though Y^ converges a.s. and in L2 while 
the maximum converges only in distribution). Will the limiting distribution 
of Yfc, and = max{Xi, . . . ,Xk} always agree, or at least have the same 
rate of convergence? From the general theory of extreme values it follows 
that 

JTETTn ( M k ~ H~ l k )) ^ U = Gumbel as k -> 00. 

f(H ^logfe)) 

This should be compared with our result for /3 = 1 (under the appropriate 
conditions of Theorem 3.2), 

(6.2) Yfc — G -1 (log k) some finite random variable as k — > 00. 

The "normalization" is the same in (6.1) and (6.2) if and only if f(x) = 1, 
that is, if and only if the tail of the distribution of X is exponential. 

The second interesting mathematical observation is that for the Beta and 
Pareto distributions, discussed in [4], we get the same kind of a.s. conver- 
gence for Tfc, after normalization (depending on (3) for all (3 > 1. In the 
families of distributions considered in the present paper, different kinds of 
asymptotic convergence hold for different values of (3. Specifically, when /? 
is relatively small, the normalized quantity converges almost surely. When 
P is in the middle range, the convergence is in probability. For large values 
of (3 the convergence is in distribution. 
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APPENDIX 

A.l. Proofs for Section 3. 



Proof of Lemma 3.1. Consider (3.3). For any nonnegative random 
variable Q, EQ 2 = 2 y(l - F Q (y)) dy. Thus 



2f°°{x-a)e- H ^dx 



EZ\a) 



e 



-H(a) 



So 



, EZ 2 (a) r(x-a)e-^ a+h ^dx 

a->oo 2a S a->oo Q S e ~{a a +h(a)) 



(A.l) 



a— >oo 



lim 



e-^ a +hH){S a s~i - [aa*- 1 + h'{a)]a 5 } 
r oc ( x « +h{x) ) dx 

J a 



a^oo e -(a«+h(a)) aa a+S-l[l + h' {a) / {aa a ~ V )} 

roc -( x "+ h {x)) dx 
= lim — 

a-^oo e -O a +M a ))aa a+<5_1 

by (2.9). Using l'Hopital's rule once more we get that the value in (A.l) 
equals 

_ e -(a a +h(a)) 

lim 

a^oo ae -( aa + h ( a )){(a + 5- l)a a + s ~ 2 - [aa a ~ l + /i'(a)]a a +< 5 - 1 } 

= lim 



Thus if we take 5 = 2(1 — a) the above limit is 1/a 2 and (3.3) follows. 
The proof for E(Z(a)) = J°° e~ H ^ dx/e~~ H ^ follows in a similar manner. 

□ 

Proof of Lemma 3.2. Through integration by parts 

[°°e- H Wdx 

f(a) = is — 

J v ' e — H{a) 

f^e- H ^H'(x)/H'(x)dx 

[-e- H W/H'(x)\? / a °° e- H ^H"(x)/(H'(x)) 2 dx 

~ e -H(a) ~ e -H(a) 

Note that H" \x) / \H' \x)) 2 tends to for a VM distribution. Now to get the 
rate, consider H(x) =x a + h(x) where liuL r _ s . 0O h'(x)/x a ~ 1 = by (2.9), and 
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assume lirn^oo h"{x)/x a 2 = 0. This implies that r^TT^p = Since 
the first term is 1/H'(a), using l'Hdpital's rule on the second term yields 

Finally, to get the rate at which f(a)/(a 1 ~ a /a) goes to 1, we need the rate 
at which h'(x)/x a ~ 1 goes to 0. If we assume that /i'(x)/(x a_e_1 ) goes to 0, 
where < e < a, we have (3.4). □ 



Proof of Theorem 3.1. Let A 



that ^Ofc_i) 

logfc 



k = (logfc'jV" and S k = ( A k ~ 1) ■ Note 



1 "fciil + (^)' thus 



log(fc ~ 1) 
log A; 



l/a 



1 



ak log k 



Hence, 



log(fc ~ 1) 
logfc 

(A k _ x - 1) - A fc _x 
-l) 2 + ii|_i 



l/a 



+ 



fc(]ogfc) 1/c 



1 



afc log 
1 



fc 2 // ^(logfc) 1 /" 
1 



+ 



a 2 k 2 (\ogk) 2 \k 3 J J k 2 (logk) 2 / 



+ 



+ 2(A fe _! - 1) 



.4 



^(logA:) 1 /" 
/ 1 



1 



ak log A; 



"fcQogfc) 1 / 4 *"* ^afclogfc ' "V^ 2 , 
Taking conditional expectations on both sides, using (2.1), we therefore get 

E(S k \F k -i) 



01 i 



< + 2(S fc _! + 1) 



(A.2) 



ak 2 (log k) 2 







+ 



E{Z 2 {Y k _ 1 )\F k ^ l ) 



k 2 {\ogk) 2 / c 



+ 2(A fc _i-l) 



fc(logfc) 1 A 



1 



afc log A; 



+ 



A; 2 
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since A 2 _ 1 < 2(S k —i + 1)- The first two terms in (A. 2) therefore cause no 
problem in the application of Theorem 2.1 to S k . By Lemma 3.1, for all k 
sufficiently large 

E(Z 2 (Y k _ 1 )\T k - 1 ) 2(1 + e) Yf±i a) 



< 



k 2 (logk) 2 / a a 2 k 2 (\ogk) 2 / a 

_ 2(1 +£ ) A^rWfc - i) i ' a ) 2{i - a) 

a 2 k 2 (logk) 2 l a 

c 2(l+ £ ) A 2 ^ ^(l + e)(A 2 _ 1 + l) 
a 2 k 2 (logk) 2 a 2 k 2 (logk) 2 



< 2(l + e) 



a 2 



2Sk-i 3 



k 2 {\ogk) 2 k 2 (logk)' 



so the second term in the last expression is summable, and again factoring 
out Sk-i the first term is also summable. 

It remains to deal with the last term in (A. 2). Prom (3.4), 

F^[l + o(l/F|_ 1 )] 



f(Y k . 



lj 



a 

Thus the first term in the square brackets in (A. 2) satisfies 

/gyp _ n:?[i + jVn-i)j 

fclogfc 1 /" a fc (log A;) 1 /" 

_ ^I?[(log(fc - 1))V^[1 + o(l/(A fc _ 1 (log(fc - l)) 1 /") 6 )] 

a&^logA;) 1 /" 

_ ^[l + o^f^qogfcr)] 

afc log k 

where u = e/a. The last line in (A. 2) can therefore be rewritten as 

( 2(^x - 1) Afc-i [i - + ggog fc/fc) + ^>(^f j/qog fcr )] 

afe log k 

We want to study when (A. 3) is positive for large k. This depends on the 
term in brackets, which to simplify notation we denote by R(x), where x = 
A k -i, and the dependence of R(x) on k is implicit. Note that for k sufficiently 
large O (log k/k) <5 k = l/(log k) u . Also note that v k = A^l x 6 k < Y k -\ ~^ 
as k — > oo and that u k < XQ £ 5 k if A k _\ > xo > 0. Hence when is sufficiently 
large, R(x) < R(x) < R(x) where 

(A.4) R(x) = l-x~ a - S k - u k x~ a 
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and 

(A.5) R(x) = 1 - x~ a + 5 k + v k x~ a . 

The aim is to show that (A. 3) is positive only when 1 — c5 k < x < 1 + c5 k 
for a suitably chosen constant < c < oo. We consider two cases: 

(i) Assume x > 1. Then (A. 3) is positive for the values of x such that 
R(x) < 0. Since R(x) < R(x), the values of x such that R(x) < 0, or, equiv- 
alently, the values of x such that x a R(x) < include the values of x such 
that R(x) < 0. It suffices to consider 

(A.6) x a - 1 - 5 k x a - u k < 0. 

The set of x such that (A.6) holds is equivalent to the set of x such that 

for k large for a suitably chosen constant < C\ < oo. 

(ii) Assume x < 1. Then (A. 3) is positive for the values of x such that 
R(x) > 0. Since R(x) > R(x), the values of x such that R(x) > 0, or, equiv- 
alently, the values of x such that x a R(x) > include the values of x such 
that R(x) > 0. Hence, we want to consider when 

(A.7) x a + 5 k x a + v k >l. 

Since 5 k and v k are arbitrarily small for k sufficiently large, there exists 
xo > such that for (A.7) to hold it is sufficient that x > xq. Therefore, 
(A.7) is equivalent to 

1 - Vk x 



(A.8) x> {TTi) >1 ' C2dk 

for k sufficiently large for a suitable chosen constant < C2 < oo. 

The above analysis shows that (A. 3) can be bounded from above by zero 
when A k _i is outside the interval c± 5 k . When it is inside, (A. 3) is bounded 
by 0(l/[fc(log &)"]). Hence (A. 3) is summable. Thus converges a.s. by 
Theorem 2.1. If S k converges to a value different from this would lead to 
a contradiction, as the sum of the terms in (A. 3) would go to minus infinity, 
while S k is nonnegative. Hence A k tends to 1 a.s. 

Note that when H(x) = x a + h(x), and (2.8) holds, then necessarily H^ix) 

x l l a + h*(x) where h Sf] x ^Sf o. Thus ^ — rrn^ fc Jl2P \ an( j the right-hand 

side of (3.6) follows. Since G~ 1 (x) = H~ 1 (x)[l + o(x)], also the middle term 
in (3.6) converges a.s. to 1. □ 
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Proof of Theorem 3.2. The first step in the proof is to show that 
(Y k — G~ 1 (logk)) 2 converges a.s. to a finite random variable as k — > oo. 

Since Y \ — > oo, there will be a (possibly random) ko such that for all 
k > ko, everything written below holds. Consider k > ko only. Let c k = 
G~ 1 (logk). Then, by (2.7) and the boundedness of /, 

(logk-log(k-l))[G-\u k )]' 



(A. 



log 1 



f(G-Hu k )) 
k 



where the 0(-^) term is positive and 

(A.10) log(fc-l)<u fc <log/c. 

Note that the last equality in (A. 9) follows since / is bounded by condition 
(B). Now write 



(Y k - c k f 



(y fe _i-c fc _i) + 



k 



-i 2 



+ (Cfe-l - Cfc) 



(y fc _i-c fc _i) H h(c fc _i-c fc ) 



2Z(Y 



k-lj 



+ 2{Y k _ 1 -c k _ 1 



A- 2 

c fc _i - Cfc) 



k 



+ (Cfc_i - c k ) 



Taking conditional expectation, conditional on J- k -\, yields 

s[z 2 (Ffc_ 1 )|j- fe _ 1 ] 



E[(Y k - c k f\ Jfc_i] = (Yfc_! - C fc_i) 2 + 

* v 

(1) 



k 2 



(ii) 



(A.ll) 



2/(Y"fc-ij 

+ (Cfc_l-Cfc) H ; (Cfc_l-CfcJ 




We shall show that the conditions for Theorem 2.1 hold. We shall examine 
each term in (A.ll) separately. We first show that for any uj > 

(A.12) F fe /fc w -»-0 as/c-^ooa.s. 
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Let W k (oj) = |£. Then clearly W fc (u;) > and 



since / is bounded (where we have denoted its bound by B). It follows 
that W k (uS) converges a.s. to a finite limit, L{u) > 0. Then also W k (uj/2) — > 
L(cj/2) a.s. But W k (uj) = W k {uj /2) jk^l 2 , thus the limit must be for all u. 

Now consider term (ii) of (A.ll). By condition (A) and (A. 12), for all k 
sufficiently large 

x E\Z 2 (Y k _ 1 )\J r k ,] Yl i ek^ 

(A.13) 1 ^ k k 2 lJ < < IfeT- a - s - 

Choose cj < - and write 1 — c<j7 = 5. The rightmost expression in (A.13) is 
then e/k 1+s , which clearly is summable. 

Term (iii) is summable by (A. 9) and the boundedness of /. 

Term (iv) is negative, and hence causes no problem. 

Term (v): note first that by (A. 9) 

f(Xk-i) , ,„ s f(Y k ^)-f{G-\u k )) (1 

(A.14) 



k J v KJ 1 v^ 2 

where d k is a value between Y k _\ and G _1 (ufc). Since G _1 is increasing, it 
follows from (A. 10) that 

(A.15) c fc _i <G _1 (u fc ) <c fc . 

Consider two cases: 

(a) F fc _i - c fc _i < 0. Then by (A.15) also F fc _i - G^(u k ) < and by 
condition (B) (v) is negative since the 0(1/ k 2 ) term is positive. 

(b) Y k _i — Cfc_i > 0. If also Y k _i — G _1 (itfc) > 0, the previous argument 
goes through, except that we still must show that (^fc-i — c k _\)I(Y k _i — 
c k _\ > 0)/k 2 is summable. Now write (Y k _i — c k _\)I(Y k _\ — c k _\ > 0) < 

(F fe _! - Cfe _!) 2 + 1. Thus 

(A.16) (F fc _! - c fc _i)/(F fc _i - c fe _! > 0)/fc 2 < (yfc " 1 ~ 2 Cfc " l)2 + 1. 

The first term on the right-hand side of (A.16) can be combined with (i) in 
(A.ll), and the second is clearly summable. 

Now suppose Y k -i — G~ 1 (ufc) < < Y k _i — c k -\. Then 

(A.17) c fe _! <Y k _ x <G~\u k ) <c k . 
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Since both \Y k — Ck-i\ and \Y k — G~ l {uk)\ are less than c k — c k -i, it follows 
from (A. 9) that (v) is summable. 

It follows that in all cases we can write E[(Y k — c k ) 2 \J- k -i] < (Xk-i ~ 
c fc _i) 2 (l + B k -\) + D k -x - Vfc_i, where B k , D k and V k are nonnegative 
random variables, and B k and D k are summable. Thus by Theorem 2.1 

(A.18) (Y k -c k ) 2k ^W a.s., 

where < W < oo is a random variable. Thus \Y k — c k \ — > k ^oo VW a.s. 

It remains to show that when W 0, Y k — c k cannot jump between y/W 
and —y/W an infinite number of times. It will then follow that the limit 
exists and is either y/W or —y/W . Recall that Y k — Y^-\ = ^r, and that by 
(A. 9) < c k — c k -\ < j, for some 7 > 0. 

Take expectations on both sides of the inequality in (A. 13). Then 

P { F i -F fc _ 1 > £} = p{f>4=p{|>^}< i ^. 

Thus by the Borel-Cantelli lemma P{Y k — Y k ^\ > e infinitely often} = 0. 
This implies P{\ (Y k — c k ) — (Y k _i — c^— 1) | > 2e infinitely often} = 0. Thus, if 
\/W > e,Y k — c k cannot jump between \/W and — VW an infinite number of 
times, that is, Y — c k will converge a.s. to y/W or — y/W. Since for W > 0, 
there always exists a small enough e > such that W — e > 0, it follows 
that Y k — c k converges. Clearly on the set where {W = 0} the statement 
(Yk — c k ) 2 — > is equivalent to Y k — c k — > 0. □ 



Proof of Corollary 3.1. The expected overshoot given X > a is 
Ho) - 1Z^L± - l igj^f^ 

J \ U ) ~ — n a — _ n a 

e a a e a 

The right-hand side follows by change of variables to y = x a . But in Abramowitz 
and Stegun [1], page 263, 

TOO 



Jx 



This implies that 



x 



u-l 



1 + 



v-l 



(4 



f(a) = -a 
a 



l-a 



1 

71a 



as x — > 00 . 



as a — > 00 . 



Equation (2.6) implies 
1 



G'(a) 



aa 



a-l 



1 

12a 



as a — > 00. 



Integrating both sides results in G(a) = a a + (a — 1) log a + O(l) for large a 
since the remainder term 0(a~( Q+1 )) has finite integral. 
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Now for any e > 0, if a is sufficiently large, then G(a 1//a +e) = (a 1 /" + e) a + 
(a-l)log(a 1 / a + e) + 0(l). But (a^ + e)" = a{l + = a + aea 1 - 1 /^ 

smaller order terms. Hence if a > 1 and a is sufficiently large, then G(a 1 ^ a + 
e) > a. A similar argument shows that if a > 1 and a is sufficiently large, 
then G{a l / a -e)<a. Therefore Iim a ^ 0O [G _1 (a) - a l / a ] =0. □ 

PROOF of Theorem 3.3. Upon taking expectations on both sides of 
(A. 11) we obtain 

'E[Z 2 {Y k ^)\F k ^\ 



E[(Y k - c k y] = £?[(r fc _i - c fc _i) 2 ] + E 



CO 



A- 2 



(") 



(A.19) 



2 + 2^ 



(iii) 



+ 2£f(Y fe _ 1 -c fe _ 1 ) 



(iv) 



+ (Cfc-1 - Cfc) 



(v) 

All we need to do is subtract — c^-i) 2 ] on both sides and sum. If a 

term remaining on the right-hand side is positive then we need to show that 
it is summable. If a term is negative it must be summable as the term on the 
left-hand side is nonnegative. Hence we see that terms (ii), (iii) and (iv) cause 
no trouble. The only term of concern is (v). But the expectation (integral 
over the density of Y k ) can be divided into an integral over three regions: 
(i) Y k ^i < c fe _i, (ii) F fc _i > G~ l (u k ) and (iii) c k -i <Y k ^ < G^(u k ). As 
in the proof of Theorem 3.2, the integrand for regions (i) and (ii) is negative 
and over the third region it is positive, but can be dealt with in the same 
way as in the proof of Theorem 3.2 by use of Corollary 2.1. The last two 
statements of the theorem follow. □ 

A.2. Proofs for Section 4. 



fc-1 



P-l 



Proof of Theorem 4.1. 
Proof of (i). 

(A.20) E{B k \F k _ 1 ) = B k _ x \\ 
Thus 

(A.21) WW = B» (l + O (i) ) + w^nA . 



f(0Y k . 
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where O(^) > 0. Thus E(B k \T k -i) > + 0(5?)) which implies that 

B k converges, to a finite or infinite limit. 

Suppose first that the limit is infinite. Then there exist k$ and D > 1//3 
such that for all k > ko, B^-i > D. But then from (i), for k > ko 

/(^(fc-l^Bfc-i) ^ R 



^ ^ [log(/3S fc _i) + (/9 - 1) log(* - 

c/3 



fc[log(/3D) + 08-1) log(fc - l)] 1+£ 

c/3 

((3 - l) 1 +efc[log(A: - 1)] 



l+e ' 



But the term multiplying -Bfc_i on the right is summable, which implies that 
(A.21) satisfies the condition of Theorem 2.1, and hence B k converges to a 
finite limit. This contradiction implies that B k converges to a finite r.v. a.s. 

Proof of (ii). Now suppose that B k converges a.s. and limEB k < oo. It 
follows from (4.1) and as in (A. 20) that B k can be written as 



Bi, = Bi, 



k-l 



where 0(l/k 2 ) is positive. It follows that B k > B k _i, so that the limit is 
positive. Since the support of the observations is not bounded, in a different 
realization one could obtain a higher value. Hence the limit is a nondegen- 
erate positive random variable. Set Bq = 0. Then 



(A.22) B k = ±(B, - Bj . l} = 0(1)± *fcl + £ £Cffii=!l. 

j'=i j=i J i=2 J 

Since converges a.s. limi?/,, exists and is finite a.s. Taking expecta- 
tions and limits as k — > oo on both sides of (A.22) and noting that EB k is 

assumed to be bounded implies that X^=l Z ^Jp~^ < 00 • This in turn im- 
plies Yl'jLi ^^ Y ji~^ < 00 • Since i?*,. converges a.s. to a random variable Wp 
for < e < Wp and a (random) &o, we have for all k — 1 > ko, (Wp — e)(k — 
if' 1 <Y k ^ < {Wp + e){k - . If / is increasing 

^ f(0Y k -i) . ^ /(^-^(fe-l)^- 1 ) 

k=ko k=kQ 



(A.23) 



flY^ f(A(k-lf- 1 ) 



2 J ^ (k-l) 
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where the inequality follows since (^r 1 )^ > (§)^> and where we have written 
A = [3(Wp-e). 
Finally, 

4 (fc-i)^ > L-A X +iy f{Ax )dx 



By change of variable to y = Ax@~ l the integral on the right-hand side 
'A(fc -2)' 3 - 1 ~p 



becomes I^{k o - 2 )0-^ y?~dy. This integral is therefore finite by (A. 23). 



□ 



Proof of Proposition 4.1. In a manner similar to the end of the 
proof in Theorem 4.1, it can be shown that if j^—jpr-dy diverges, then 

lim n ^, OQ Y^ =ho ^('jk^~ 1 ) also diverges. 



=fcoF 
Note that 



(A.24) B k = B k ^ 



l + O 



Let F k be the c.d.f. of Z* (f3Y k -i) conditional on Y k -i- Let Fy be the c.d.f. 
of V. Let U h U 2 , . . . ~ U[0, 1] i.i.d. Define V k = Fy\U k ) (so V are i.i.d. with 
c.d.f. Fy)- Clearly, V k < F k ~ l (U k ) conditional on Y k _\ once f3Yk-i > ao 
(which will happen with probability 1). 

It follows that one can imbed the sequence Y±, Y 2 , ■ . ■ in a probability space 
where V±, V 2 , ■ ■ ■ are i.i.d. with c.d.f. Fy and 

Vi < Z*{pY k _i) for all i such that > a 

conditional on Y k _\. Define V* = cl{Vi > c ) f° r some c such that P(Vi > 
c) > 0. Clearly, V* < Z*{f3Y k _i). Note that V* is c times a Bernoulli random 
variable. Now 

(A,5) n^ nBYk _^_ v .^A. 
Recall that 

so that for a constant ai that is independent of Y\ and fc, 

k 

(A.26) n > f[ LlLll > 

i=2 3 



26 A. M. KRIEGER, M. POLLAK AND E. SAMUEL-CAHN 

Hence (for k such that fPY k ^\ > clq and a\ a constant) since ^(a) increases 



m a 



(A . 27) ^.fro^;**' 1 - 1 ''-') 



Finally, condition on Y"i and denote 

Ck = P • 

By (4.4) and what we showed above, linin^oo Ylk=l c fc = °°- It is a straight- 
forward application of Kolmogorov's three-series theorem (cf. Feller [2], page 
317) that 

(A.28) lim V V k c k = oo a.s. 

fc=l 

Putting (A.27) and (A.28) together obtains that ^^^ ^(/3F fc _i) is not 
summable. This and (A. 24) imply that lim^oo U/% = oo a.s. □ 

Proof of Theorem 4.2. EB k converges to a finite limit by (4.2) since 
/ is bounded by assumption (b) 

Var B k = Var —„ Y k _ x + 



K_1 k$ 



(A.29) 



(fc-l + Z?) 2 ^-!) 2 ^- 1 ) 



Var^. + X-^^-)) 



k 20 



+ 2 * k \t^ cov(y fc _i, Z03y fc _i)). 



We shall treat each of the three terms in (A.29) separately. 

(i) It is easily seen (by taking log) that the value in the square bracket 
isl + 0(£). 

(ii) From condition (c) and the convergence of EB k to a finite limit 
Var(Z(/3F fc _!)) < EZ 2 (f3Y k ^) < c^EY k ^ < c/3(limEB k + e) 



k 2p k 2p k 2p k p+i 

Thus the second term in the right-hand side of (A.29) is summable. 

(iii) We now show that the third term in the right-hand side of (A.29) is 
negative or 0: 

= E{Y k ^Z{pY k ^)) - E(y k . 1 )E(Z(0Y k . 1 )) 
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= EFk-iE(z(PYk-i)\rk-i)] - s(F fc _i)£;[£;(z(/9F fc _i)|7fc_i)] 

= ^ElPYk.JipYk^)} - ^E{0Y k _i)E f (pY k ^i) 

= icov( / 9F fc _ 1 ,/( / 9F fc _ 1 ))<0, 

where the last inequality follows from (b). It follows that (A. 29) satisfies the 
condition in Corollary 2.1 with z n = Var B n , and the result follows. □ 

A.3. Proofs for Section 5. 

Proof of Theorem 5.1. Let Pj = 1 — F(Yj-i). We shall use Theorem 
2.2 conditionally on the sequence {^jfc}. Let bj = Yli=i Pf anc ^ Qi = P% ~ 
Tj_i — P" 1 with To = 0. Obviously, the sequence {bj}JL 1 satisfies the first 
condition of Theorem 2.2. 

Note that conditional on the sequence {Pj} the distribution of Tj — Tj_i is 
Geometric (Pi) and these differences are conditionally independent of each 
other. Hence {Qn}%Li is a sequence of conditionally independent random 
variables with zero expectation and variance (1 - P n )/Pn- We shall show 
that the second condition of Theorem 2.2 holds 

oo oo .. j-j / / n \ ^ oo 1 

E^A/ E^ 1 <E^. 

n=l n=l n 1 \j=l / n=l nl 

It therefore suffices to show that for all n > uq 

n 

(A.30) yf^±l> An V^ ogn 

3=0 Pj+1 

for some A > 0. We shall actually show that for any < e < 1/4 there exists 
jo such that for all n> j > jo 

(A.31) r^±i>4-^. 

Pj+l 

From (A.31) it is immediate that (A.30) holds, since 

^ P i+1 ^ P j+X ~ n l+e ^> J ~ n 1+e 

3=0 J j=jo J 3=30 

Note that for H(x) = x a + h(x) for a > and /i that satisfies (2.8), we have 
by Theorem 3.1, Yj = (log j) 1/a (l + £j) with Ej 0. Thus 




H(Y j ) = (\ogj)(l + s j r 



1 + 



(log j)(l + 
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and since h(x)/x a X ^S? o it follows that for any e > there exists jo such 
that for all j > jo 

(l + e)logj>iJ(F i )>(l- e )logj, 

which implies, since [1 — F(Yj)]~ 1 = expH(Yj), that 

(A.32) j 1 +*>[l-F(Y j )]- 1 >j 1 - e . 

Thus (A.31) follows. 

Note that here S n of Theorem 2.2 equals T n — Ya=i > ^ nus b^Sn ~~ ^ 
a.s. is equivalent to T* — 1 — > a.s. Since this result holds for any conditioning 
sequence {Yk}-, it holds unconditionally. □ 

Proof of Corollary 5.1. In (A.32) take any e > 0. Hence for some 
positive constants ci,C2,Cj,c| and all k large enough 

c ^ 2+£ > C2 £ > pi - F^or 1 > ci jy- e > c ^ 2 - £ . 

Since =j — Tfc _ — )■ 1 a.s. for k large enough and c\* a positive constant 



£Li[i-.f(Fj 



fc2_<5 Ej=i[i-^(^j)] _1 fc2 ~ 5 

> c**/c <5 ~ £ ->■ oo a.s. if 5 >e. 
The proof for follows in a similar manner. □ 

Proof of Theorem 5.2. 

Proof of (i). We shall (again) use Theorem 2.2 and show (A. 30) where 
Pj = 1 — F(l3Yj-i). Assume that ko (random) is such that for all k > ko, 
Zf. < r yY k _i. Such a ko exists with probability one by Lemma A.l. Then for 

k > ko, 

7 t =7 t - 1+ Z ' + ( V )n '' ^( 1 + I± ^ 

Thus 

(A.33) Yt < Yt-x (l + 1 + a ^ (* + 1 ) • 

where d = (7 + /? — 1)/3 UQ and p ua (and for later purposes /?;□,) is defined by 
(A.34) 1 + Pi a x<(l+x) a <l + p ua x for0<x<l, 
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where pi a = a and p ua = 2 a — 1 when a > 1 , while = 2 a — 1 and p ua = a 
when a < 1. We have used the inequality (1 + x) a < 1 + (2° — l)x, valid for 
all a > 1 and < x < 1. We can therefore write, using (A. 33), 

(A.35) £±i = e xp{-/3 Q (F^ - F^)} > expj-^F^ 

Now let fci > fco be so large that for all k > k±, Yf. < (W + e)k^~ l , which 
exists, by Theorem 4.1. Then we can continue the inequality in (A.35), by 

Pk+i 
Pk 

To simplify notation let 

(A.36) r = a(/3-l)-l, 



thus t > —1. For j > > ki, we have 

_ TT P fc+i J p V" fcrl^^J 

Pj+i 



P ^>exp £ F >exp{--^ T (n^-( J + ir 1 )}. 



Thus 



But 



thus 



fc=j+i ^ fc=j+i 



P n+1 > ^ -Pn+1 > e -[B/( T +l))nT+ 1 ^ g[B/(r+l)] 
j=l Pj+1 j=k! Pj+1 j=ki 



n en 



it 



(A.37) Y,T ±1> I e^ +1 )l* T+1 dx/e 



[B/(T+l)]x-+\,,[B/(T+l)]n^ 



j+l Jki+1 



We would like the right-hand side of (A.37), divided by n 1//2+£ for some 
(small) e > 0, to tend to a nonzero limit in order for (A. 30) to hold. Thus 
consider, by use of l'Hopital's rule, the limit as y — > oo of 

Ik 1+ i eAxT+1 dx 



\{y) = — 1 j, 47|T+1 where A > is any constant, 



y 6 e AyT 



lim q(y) = lim 



e 



y^i, * ^> y^O e Ay T +l (jjy&-l + A(T + l)y T+<5 ) ' 

which is finite when r + 5 = and tends to oo when r + 5 < 0. Now for 
6 = 1/2, by (A.36) we get a finite limit when a((3 — 1) — 1 + 1/2 = 0, that is, 
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ft = 1 + l/(2a). Thus for /3 < 1 + l/(2a) there will exist an e > such that 
the value of Y^tl §77 > n 1 / 2 ^, and the result (i) follows. 
Proof of (ii). Let 

(A-38) 7(*) = 

in (5.2). Then clearly j(k) — )• 0. We shall show later that [(1 + o k (l))t^{k) x 
eC^^i-i)"] of (5.2) is arbitrarily close to for 2 < j < k for all sufficiently 
large k and f3 < 1 + 1/ a. It suffices to show this for j = k. We can then write, 
using (5.2) and (A. 38), 

k 

-(1 + e )t = -(1 + e) J2Hk)e^ Yj - l)a < log Ee~^ k)fk 
i=2 
fe 

< -(1 - e)J2H k ) e(PYj ' ir = -(! " 
i=2 

It follows that linifc^oo E(e~ t " / ^ Tk ) = e~', which is the desired result. We 
still must show that [(1 + Ok(l))t , y(k)e^ Y ^-i) Q ] of (5.2) is arbitrarily close 
to for j = k for all sufficiently large k and /3 < 1 + 1/a. Let p\ a be defined 
by (A.34) 

i i-i > : > : 



3 J 



for all j sufficiently large, where by Theorem 4.1 lim = W > 0. Thus 



1 

*-~>3=30 

for suitable large jo, as long as (/? — l)a — 1<0, that is, (3 < 1 + 1/a [where 
we have let D = p ia p a (P - l)W a {l - e)L_ 

Proof of (iii). Here let j(k) = e - ^^- 1 . With this j(k), (5.2) becomes 

logii;e-'^ = -^log[l + (l + 0fc (l))te- /3a ^- 1 - F ?-)] 
i=2 

(A.39) =-log[l + (l + o fc (l))t] 

- X>g[l + (1 + 0k (l))te-^=^- ¥ "-^}. 

J'=2 
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Now for some D > (dependent on {l^}) 

< e p 1 i i- 1 ' < e i < e 

Thus 

where v = a(j3 - 1) - 1 > 0, that is, j3 > 1 + I /a. But 

fc-i f& Dx 1 ^ 1 



lim \ ( 

k— »oo ^ — ' 



-D/(i,+ l)[(fc-l)-+ 1 -i-+ 1 ] 



; 2 v 



i=2 



d.r 



l'Hopital 



lim 



D{v + l)k v 

Since the sum in the right-hand side of (A. 39) tends to and 

lim £(e"' 7(fc)Tfc )^ l/(l+i), 



fc— >oo 



which is Ee where Q ~ Exp(l), gj^ a — tends in distribution to an expo- 



nential distribution. The above proof shows that JZjLi e^ Y ^ a /e^ Yk -^ 
1, thus also T£ -A Exp(l). □ 

Lemma A.l. Let X ~ F where F is Q a with a > 0. £e£ (3 > 1 and /et 
Zfc 6e £/ie random "overshoot" over (3Y ' k -\. For any 7 > 0, and any < 5 < 
(/3 — l)a ; P(Z k > jY k ^i/k s infinitely often) = 0. 

Proof. Consider the event A = {Yk/k 13 ^ 1 -^W,0<W< 00}. We know 
by Theorem 4.1 that P(A) = 1, and hence we shall assume that A occurs. 
Let A k = {Z k > ~{Y k _i/k s }. We shall show that Y^T=i P ( A k) < 00 so that 
the result will follow from the Borel-Cantelli lemma. 

P(A fc |F fc _!) = exp{-[(-y/k s + (3) a ~ F*W$-i 

-[h(( 1 /k s + (3)Y k _ 1 )-h((3Y k ^ 1 )]} 



exp<^ -/T 1 + 



ill 
k s 



where /3y fc _! < Q k < ([3 + #)y fc _i. Write, by (2.9), 



-ti{Q k )±Y k . 



h'{Q k ) 7 T7 
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where \oj~\ < £ for k>k$ with large enough k$ and e > arbitrary. Note that 
by (A. 26) ko can be chosen to depend on Y\ only. For e small enough this 
implies 

p^m.o < exp{-r + o[^YU + \o k \^n-i) 



< 



exp|-i/3 Q ~ 1 Q 7 pcy 1 A: /3 " 1 | < expi-dY^ 13 " 1 ^}. 



The next to last inequality follows from (A. 26). Hence P(A k \Yi) < exp{— dY\ x 
^-i-<5| for k>k = k (Yx). If 8 < p - 1, P(A fc |Yi) < oo, so, by the 

Borel-Cantelli lemma, conditional on Yx, P{A k i.o.|Yi) =0. But this is true 
for all Y 1 . Hence P(A k i.o.) = 0. □ 

Proof of Theorem 5.3. Let <y(k) = 1/^=1^ where d = P a . We 
write (5.2) as 

k-i _ 
(A.40) log£(e-*^ fc ) = - J>g[l + t(l + 0fe (l)) 7 (fc)e dr H- 

i=i 

Since when R is exponentially distributed with mean fi, logE(e~ tR ) = 
— log(l + fit), it is sufficient to show that the right-hand side of (A.40) con- 
verges, as k — > oo, to — Yl'jLi l°g[l + tftj]- 

First consider 

(A.41) j(k)e^-j 



where S hk = Zi=Lj e d(Y ^ Y ^ and T hk = e~ d ^~ Y ?\ Note that 

Yi = Zi + (3Y where Zi is the amount above (3Yi_i for the ith item that 
is kept. Hence, 

- _ (i-rjY^ + Z. + pY^ 
* i — 

I 

= Yi-x H — — 1 because B — 1 = 1/a. 

i 

By Lemma A.l, for all i sufficiently large 

Yf = Yf_Jl + -^ + l- 
V iY^x ax 



Yf_i ^1 + — + smaller order terms^ . 
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Let w = limfc-j.oo Yk/k l / a . Therefore, lim^oo Yf — Yf_ 1 = w a and for fixed 
b, lirrij_ i . 00 Yf +b — Yf = bw a . This implies 

Hm s jk = Hm y e m-n- i ) = iim y (AtXi-^-n-i) 

k— >oo ' k— ¥00 — ' k— >oo — ' 

i=k-j 1=0 

(A.42) 

j ~ 1 P dw a j -I 

E dlw a _ e ~ 1 

e dw a _ l ' 

For any e > there exists m such that (1 — e)w a < < (1 + e)u> a for 
all i>m. This implies 

m— 1 k—j — 1 

lim 2),* = hm y e -«-r^) + Hm y e-^-i-^f). 

k-^oo k— >oo — ' fc— >oo ^- -■' 

Fix m. Then the first limit on the right-hand side is clearly zero since Y^_j — > 
oo as k — > oo . Consider the second term 

fe— j— 1 k—j—m 



hm sup V e -^S-,-^) < hm V e -*» a (i-*) 



1 



Similarly, 

hm inf k y e-^-n) > * 

Hence, 

(A. 43) lim Tj i. = , for any fixed j. 

Substituting the results (A.42) and (A.43) into (A.41) yields 
lim ^(jzje^ k ~3 



fc^oo ,v ; (e dwa 3 - l)/(e dwa - 1) + ll(e dwa - 1) 

(A.44) 

dw° 



dw 



— =Hj for fixed j. 



Returning to (A. 40), fix re. 

fe-i 

-^log[l + t(l + 0fc (l))7(fc)e dy ^ 

3=1 

n-1 

(A.45) =-^log[l + t(l + o fc (l))7(fc)e 

3=1 



k-3 
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fc-1 

dYi 



^log[l + t(l + Ofc (l)) 7 (£0e fl 



fc-j 
j=n 



Equation (A. 44) implies that each term in the sum of the first expression on 
the right-hand side converges to log(l + t/ij) as k — > oo. We need to show that 

lim^oo X^=nl°g[l + + °k{~L))7{k)e dY k ~i] can be made arbitrarily small 
by choosing n to be sufficiently large (all terms in the sum are positive). 
Note that ^y(k)e dY k ~i < For any e > choose n large enough so that 

> (1 — e)w a for all i>n. For j > n, 

i=k-j 1=0 
J- 1 p d«; Q j(l-£) 1 

dh« Q (l- £ ) 6 1 



e dw a (l-e) _ l ' 
1=0 



Hence, 



_ p dw a (l-s) i 

^^)^-< e i. J ( 1 -.,: 1 <^°'°°-""- e) - 

Choose /c large enough so that Ofc(l) < e. Then 

fc-i 



lim Vlog[l + i(l + 0fc (l)) 7 (A;)e 



dY 



k—>oo ^ — J 
j=n 



k-i 



fc-1 

< lim yifl + eje-^^ 1 )!^ 

fc— >-oo ^ — ' 

t (l + e ) e -^ Q (n-2)(l-e) 

< — 

e dw a (l-e) _ l 

Since the right-hand side goes to zero as n — > oo the second term in the sum 
in (A. 45) can be made arbitrarily small by choosing n sufficiently large. □ 
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